Quantification of Portrayal Concepts using tf-idf Weighting
نویسندگان
چکیده
منابع مشابه
Improving Native Language Identification with TF-IDF Weighting
This paper presents a Native Language Identification (NLI) system based on TF-IDF weighting schemes and using linear classifiers support vector machines, logistic regressions and perceptrons. The system was one of the participants of the 2013 NLI Shared Task in the closed-training track, achieving 0.814 overall accuracy for a set of 11 native languages. This accuracy was only 2.2 percentage poi...
متن کاملNear Duplicate Image Detection: min-Hash and tf-idf Weighting
This paper proposes two novel image similarity measures for fast indexing via locality sensitive hashing. The similarity measures are applied and evaluated in the context of near duplicate image detection. The proposed method uses a visual vocabulary of vector quantized local feature descriptors (SIFT) and for retrieval exploits enhanced min-Hash techniques. Standard min-Hash uses an approximat...
متن کاملClustering scRNA-Seq Data using TF-IDF
In this abstract, we propose several computational approaches for clustering scRNA-Seq data based on the Term Frequency Inverse Document Frequency (TF-IDF) transformation that has been successfully used in the field of text analysis. Empirical evaluation on simulated cell mixtures with different levels of complexity suggests that the TF-IDF methods consistently outperform existing scRNA-Seq clu...
متن کاملUsing tf-idf as an edge weighting scheme in user-object bipartite networks
Bipartite user-object networks are becoming increasingly popular in representing user interaction data in a web or e-commerce environment. They have certain characteristics and challenges that differentiates them from other bipartite networks. This paper analyzes the properties of five real world user-object networks. In all cases we found a heavy tail object degree distribution with popular ob...
متن کاملBag of Works Retrieval: TF*IDF Weighting of Co-cited Works
Although it is not presently possible in any system, the style of retrieval described here combines familiar components—co-citation linkages of documents and TF*IDF weighting of terms—in a novel way that could be implemented in citation-enhanced digital libraries of the future. Rather than entering keywords, the user enters a string identifying a work, called a seed, to retrieve the strings ide...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Information Sciences and Techniques
سال: 2013
ISSN: 2319-409X,2249-1139
DOI: 10.5121/ijist.2013.3501